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NONPARAMETRIC  ESTIMATION  OF  THE  GENERALIZED  VARIANCE 
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Summary: 

For  multivariate  distributions  with  finite  second  order  moments,  a 
nonparametric  symmetric,  unbiased  estimator  of  the  generalized  variance  is 
considered,  and  it  is  shown  to  be  (nonparametric)  optimal  for  the  class  of 
distributions  having  finite  fourth  order  moments.  A  jackknifed  version  of  the 
sample  generalized  variance  is  also  considered  as  a  contender;  it  is 
computationally  more  convenient  and  asymptotically  equivalent  to  the  former. 
It  is  also  shown  that  the  second  estimator  performs  quite  well  (in  large 
sample)  relative,  to  the  optimal  normal  theory  estimators  under  several  loss 
functions.  ,  .  '  ; 
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1.  INTRODUCTION 


Let  x,,...,xn  be  n  independent  and  identically  distributed  random  vectors 
(i.i.d.r.v.)  with  a  distribution  function  (d.f.)  F  defined  on  the  p(M-) 
dimensional  Euclidean  space  EP.  We  assume  that  F  c  J  -  {F:  Ji|x||2dF(x)  z  ®). 
The  parameter  of  interest  (0)  is  the  generalized  variance  |j|,  where 
I  =  Ep{(x  "  Ex)(x  -  Ex)’}  and  (•(  stands  for  the  determinant.  A  good  amount  of 
work  has  been  done  on  the  estimation  of  0  when  F  is  assumed  to  be  a 
multinormal  d.f.;  the  approach  has  mainly  been  decision  theoretic  and  the  main 
result  states  that  the  best  multiple  of  the  sample  generalized  variance  can  be 
improved  on  (in  terms  of  risk)  by  using  testimators  [c.f.  Stein  (1964), 
Shorrock  and  Zidek  (1976),  Sinha  (1976),  Sinha  and  Ghosh  (1986),  and  others], 
although  the  amoung  of  improvement  is  marginal  in  most  cases. 

An  alternative  nonparametric  approach  to  the  estimation  of  0  is 
considered  here.  In  Section  2,  a  symmetric,  unbiased  nonparametric  estimator 
is  derived  and  its  optimality  is  established  through  the  use  of  Hoeffding’s 
(1948)  U-statistics  theory.  A  second  estimator  based  on  jackknifing  on  the 
sample  generalized  variance  is  found  to  be  computationally  more  convenient 
and  asymptotically  equivalent  to  the  former  nonparametric  estimator  of  0.  Like 
the  other  (improved)  parametric  estimators,  the  second  nonparametric  estimator 
also  comes  out  as  a  multiple  of  the  sample  generalized  variance,  and  it 
performs  quite  well  (at  least,  for  large  samples)  compared  to  the  optimal 
normal  theory  estimators  under  several  loss  functions  (vide  Section  3).  Thus, 
the  jackknifed  estimator  seems  to  enjoy  the  parametric  affinity  and 
nonparametric  robustness  in  broad  setup.  The  former  nonparametric  estimator 
is,  however,  somewhat  computationally  involved  .(particularly,  for  large  n). 
Finally,  in  Section  4,  some  concluding  remarks  about  estimation  of  |£|*/P  are 
made. 


2.  MAIN  RESULTS 


Let  x,,...,xn  be  n  independent  and  identically  distributed  (i.i.d.)  random 
vectors  (r.v.)  with  a  distribution  function  (d.f.)  F,  defined  on  EP,  for  some 


p  *  1.  It  is  assumed  that 


F  e  3  =  {F:  /  8x8 *dF(x)  ^  «},  (2.1) 

where  8 ’8  stands  for  the  Euclidean  norm.  Our  goal  is  to  estimate  the 
functional  (the  generalized  variance) 

9(F)  =  | 1(F) |  =  lEp{(x  -  Ex) (x  -  Ex) ’ } | ,  (2.2) 

where  | A  {  stands  for  the  determinant  of  the  matrix  A.  Note  that  when  F  is  a 

multinomial  d.  f .  with  unknown  mean  vector  /i  and  dispersion  matrix  £,  then 

8(F)  =  | F j  is  typically  estimated  by  9  =  |s  |,  where 

n  n 

S  =  (n  -  1)  1  £n  (x.  -  x  )(x.  -  x  )’  and  x  =  -  1°  x..  (2.3) 

n  1=1  l  n  l  n  n  n  1=1  i 

To  motivate  a  nonparametric  estimator  of  9(F),  first,  we  may  note  that 
1(F)  =  EF*o(xitxa),  v  F  *  jF,  (2.4) 

where 

♦0(xi>xj)  -  2  fxi  "  xaHx,  ~  xa)’,  v  xjtx2  e  EP.  (2.5) 


Further,  we  may  note  that 


I  ♦  (x. ,x . ) 
iin  °  l  J 


S  ,  v  n  *  2. 
n 


(2.6) 


Thus,  Sn  is  an  optimal  (symmetric,  unbiased  and  minimum  risk)  nonparametric 
estimator  of  1(F),  F  e  However,  8(F)  is  a  polynomial  function  (of  degree  p) 
in  the  elements  of  1(F),  and  hence,  it  is  easy  to  show  that  |sn|  =  8n  is  not 
unbiased  for  8(F),  although,  the  bias  of  8n  is  typically  of  the  order  n-1.  Our 
first  goal  is  to  consider  a  symmetric  and  unbiased  nonparametric  estimator  of 
8(F). 


Let  xj  =  (x, j,...,Xpj),  j  =  l,...,2p  be  2p  vectors  and  define 


*  x  ,  .  .  .  ,  x  )  = 
i  ap 

I  1  ,  1  w  .  1  , 

"IX  -X  iix  -X  1,  -'X  -x  )  (x  -x  )  ,  .  .  .  ,  -l  x  -  X 

2  ii  12  i  2  2  2:1  2*  i  4  2  pap- 1 


(2.7) 


)  -  x 

pap  ap  1 


X  )  . 

JP 


Note  that  for  the  matrix  (of  order  p  *  p)  in  the  determinant  in  (2.7),  the  jth 


column  depends  only  on  x  .  ,  x  .,  for  j  =  Thus,  for  $(x  ,...,x  ), 

2.J-1  2J  1  2p 

the  p  columns  of  this  matrix  are  stochastically  independent.  Hence,  using  the 


standard  expansion  for  the  determinant,  it  readily  follows  that 


E  ♦( x  , 
F  i 


,x  >  =  <r 

ip  1 


I  =  |£(F)|  =  «(F),  v  F  I  J,  (2.8) 
P 


where  a.  =  \<j  .  .)  stands  for  the  jth  column  vector  of  £(F),  for 

J  »J  PJ 


j  -  l,...,p.  Thus,  ♦(xj,...,x  )  is  a  kernel  of  degree  2p  (although,  it  is 

not  a  symmetric  one),  and,  hence,  following  the  steps  in  Hoeffing  (1948),  a 
symmetric,  unbiased  estimator  of  8(F)  is  obtained  (for  n  ^  2p)  as 
-[  JP] 


U  =  n 
n 


2  ♦  ( x . 

>-i  *. . . *i  £n  1 , 

1  2  p  1 


,x.  ) , 

JP 


(2.9) 


where 


.  I  2  P  J  _ 


n.  . . m  -  2p  +  1 ) :  n 


fp] 


=  ( n 


[  P  J  x  -  1 


(2.10) 


Being  a  U-statistic,  Un  shares  the  nonparametric  unbiasedness  and 
optimality  (minimum  variance/minimum  risk  with  convex  loss  functions) 
properties  when  F  is  allowed  to  vary  over  a  subclass  of  J  for  which  the 
variance  or  the  risk  of  Un  is  properly  defined  (viz.,  F  t  J* ,  where 
J*  -  (F:  J||x||4dF(x)  «•).  Un  is  a  symmetric  function  of  x,,...,xn. 


It  is  interesting  to  note  that  for  p  ^  1,  Un  *  |  Sn  |  =  en,  and 
moreover,  unlike  8n,  Un  is  not  a  sole  function  of  the  elements  of  Sn. 
Thus,  in  the  normal  theory  case  (for  F),  whereas  it  is  possible  to  choose  a 
positive  constant  cn(P  (depending  on  n  and  p  (n  ^  p ) ) ,  such  that  cnip0n  is 
unbiased  for  8(F)  (and  a  symmetric  function  of  the  sample  observations 
too),  cn|p8n  may  not  be  unbiased  for  8(F)  when  F  is  not  normal  and  p  *  2. 
On  the  other  hand,  the  proposed  nonparametric  estimator  Un  in  (2.9),  for  p 
*  2,  involves  the  matrix  Sn  as  well  as  some  other  statistics  (having  smaller 
contributions).  To  make  this  point  clear,  consider  the  simplest  case  of  p  : 
2.  We  have  then 


U 

n 


.2  ♦  '  x  j  ,  X  j  ,  X  j  ,  X  j  ) 

i-i  2 ~ 1  in  1  2  3  4 

1  4 


_ nin-1  1 _ 

n-2 )  ( n- 3 ) 


“A  R 

n  J  n 


(2.11) 
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where 


R  =  n 
n 


•[3] 


I  -[3]  T 

4  i -ixjxk-n 


*xj.xi,xk) 

(2 

2 

( X  .  -  X  .  )  ( x  .  -  X 

•  MX  . 

“  X  i 

11  1 J  11  1 

k  2i 

2j 

(x  .  -  y  .)  ( x  .  -  x  .) 

(x  .  - 

X  , 

U  1J'  21  2  J 

2  1 

2k 

Note  that  R  is  not  a  sole  function  of  S  .  A  very  similar  treatment  holds 
n  n 

for  general  p  *  2.  Whereas  for  E  |  Sn  |  ,  we  need  that  EJ|x||2p  ^  «>,  for 

r 

E  Un(=S(F)),  the  second  moment  suffices. 


To  explore  the  relationship  between  Un  and  0n,  we  note  that  the  von 
Mises’  (1947)  functional  corresponding  to  the  kernel  in  (2.7)  is  given  by 


V  =  /.../  ♦  '’x  ,  ...,x  )dF  (x  )...dF  (x  ) 
n  t  ap  n  i  n  2p 


(2.13) 


=  n  2P.  I  ....  I  ♦(x.  , . . . ,x  ) , 

1l_1  1  2p~  *  1  2P 

where  Fp f x )  =  n  l"=iI(x^  ^  x),  x  £  EP  is  the  sample  d.f.  Using  the 
identity  that 

l  l  (x.  -  x  )(x.  -  x.)’  =  2n  l  (x.  -  x  ) ( x .  -  x  )’,  (2.14) 

j=i  j=i  i  J  l  J  i-ji  n  l  n 

we  immediately  obtain  from  (2.7)  and  (2.13)  that 


n 


n 


_  fnzilp 


I Sn j  =  (1  -  n  )pe 


(2.15) 


Further,  (S^.n  s  p}  is  a  reversed  martingale,  so  that  noting  that  |sn| 
is  a  convex  function  of  Sn,  we  claim  that 

(®  ,  n  i  p  +  1}  is  a  nonnegative  reversed  sub-martingale.  (2.16) 

As  such,  using  the  reversed  submartingale  convergence  theorem,  we 
immediately  conclude  that  en  -»  |l(F)|  =  9(F)  a.s.,  as  n  -*  <■>,  and  hence 

I  Vn  -  »n!  =  0(  n  * )  a.s.,  as  n  -»  ®  (2.17) 


On  the  other  hand,  i f  we  assume  that 

E_<",  x .  ,  .  .  .  ,  x .  ^  ®,  v  1  ^  i  = 

F  l  i  i 

I  2  p 


i  -  L’o,  ( 2 .  IS) 

aP 


L  * 


-  6  - 


then  using  the  results  in  Section  3.2  fviz.  (3.2.9)]  of  Sen  (1981),  it  follows  that 


[ U  -  V  |  =  0(n  )  a.s.,  as  n  -*  » . 

n  n 


(2.19) 


Combining  (2.17)  and  (2.19),  we  obtain  that  under  (2.18), 


| U  -  9  I  =  0  ( n  )  a.s.,  as  n  -»  «> . 
n  n 


(2.20) 


Also,  by  the  reversed  martingale  property  of  U-statistics,  we  conclude  that 

(2.21) 


(U  ,  n  *  2p)  is  a  reversed  martingale, 
n 


Thus,  writing  0  =  U  +  (0  -  U  ) ,  we  conclude  that  U  represents  the 

n  n  n  n  n 


reversed  martingale  component  of  0  ,  while  the  sub-martingale  component 

(0  -  U  )  is  0(n  * )  a.s.  This  can  be  interpreted  as  the  asymptotic  optimality 


robustness  of  ®n(r  I  ^n  I  )  for  estimating  0(F),  for  possibly  non-normal  F.  This 
decomposition  along  with  (2.20)  may  also  be  utilized  in  the  motivation  of  a 
.jackknifed  version  of  0n,  which  would  reduce  the  bias  without  compromising 
the  (first  order)  asymptotic  ortimality. 


To  pose  this  jackknifed  version  (0n)  of  0n,  we  define 


Then 


: 

--  (n-1)  1 

n 

l 

,  . . (x  - 

(  l)  w 

X  )  (x 

- 

(2. 

22) 

n-i 

(X-  1 

(«*l)  a 

n-i  a 

n-i 

X(i)  : 

=  (n  -  1)~ 

1 

n 

l  ,  x  , 

for  i  = 

1 ,  .  .  .  ,n 

(2. 

23) 

n-  i 

a 

:=l  («*l)  a 

*  (  i  )  - 

en-i  ' 

for  i  =  1 , .  . 

(2. 

.24) 

0 

-  n@  -  ( n 

l)efl), 

i  =  1,... 

,n. 

(2. 

.25) 

n,  l 

n 

n-i 

•S  , 

-i  2  * 

n  I  0 

=  0  +  2=1 

n 

I  i*  ~ 

0(i)). 

(2. 

.26) 

i=  i  n 

,  i 

n  n 

i  =  i  n 

n- 1 

Using  the  results  in  Sen  (1977),  it  readily  follows  that 

- 1 


Ep(0*  -  0'F))  =  o(n  *). 


(2.27) 


We  may  consider  0^  aK  a  competing  nonparametric  estimator  of  0(1’). 


n 

n-i 


Note  that  by  definition 


,(n) 


in-1 ) S  =  (n-2)S  + 

n  n-i 


( x  -  X  )  ( X  -  X  )  . 

n  n  n  n 


so  that 


ls£l  =  S  S.  - 


S„  ~  77  1  W - 9  \  (  X  -  X  )(x  -  X  ) 

n -2  n  (n-l)(n-2)  n  n  n  n 


=  (r“^)P!s  -  — ———"2"  (x  -  x  )(x  -  x  )  ’ 

n-2  n  (n-1)  n  n  n  n 


Thus,  if  we  define 


m 


f  and  note  that 

n 

I  ^  • 

i  =  i  ni 


4)pl 

s  Id  - 

n  ,  —  >  1 

2  (x  -  x  )  S 

n 

(n-i ) 2  n  n  n 

(x.  - 

1 

m 

1  X 

(x.  -x),  i  =  1, . . 

1 

n  n 

in 

n 

=  I 

(x .  -  X 

)’s_1(x.  -  x  ) 

i  - 1 

1  n 

n  1  n 

-1  n 

=  TrfS  V  (x 

.  -  X  )  (  X .  -  X  )  ’  ) 

n  i  =  i 

1  n  1  n 

=  ( n 

-  1)P, 

for  every  n  *  2),  we  obtain  from  (2.26)  and  (2.29)-(2.31 )  that 


e*  =  0  -  n - -  ?  (~ - J)P  e  {1 - D — -  A  .!  +  (n-l)e 

n  n  n  n-2  n  '  ' ' 1 


(n-1)  m 


0  n  _  inzD^l  P 

n  n(n-2)P  i=, 


I  <1 


(n-1)  ni 


2  *  ■  )  +  n  -  1 } 


V1 


(n-1 ) 


p+i 


(n-2)P  H  n 


,  n  -  1  p 

+  P  ( ~  1  9 )  +  n  -  If 


n 


(n- 1 r  (n-l-p ) 
(n-2)P 


e  (1  wn  -  l)fl  - 
n  ! n-2 ) P 


i  H  ,  In-im  -  si® 

n  n- 2  n-  1 


cn , p  en<  say. 


(2.32) 


Note  that  for  p  =  1,  c*(  ,  -  1  while  for  p  *  2,  <*, ,  p 
Thus  we  have 


IV.  r  1 . 

•i  ,,  ■’ 


0  n 


9*  "  "* n  I  ~  0(n  1  fi-s.,  as  n  -»  ®. 


(2. ‘A3) 


This  jackknifed  estimator  (»n>  belongs  to  the  class  {c  t»n)  of  adjuster 

n,  p 

estimators  which  have  been  studied  extensively  by  Sinna  and  others  ([4],  [5], 


[6]).  It  may  be  of  interest  to  compare  this  jackknifed  estimator  with  some  of 
the  other  ones  (in  terms  of  the  (asymptotic)  mean  squares)  when  F  is  normal. 
This  will  cast  light  on  the  (near)  optimality  of  8*  for  normal  F.  Details  appear 
in  Section  3. 


Recall  that  by  (2.25),  (2.26)  and  (2.29), 
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Therefore,  we  have 
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(2.32)  and  (2.35)  can  be  used  to  construct  an  asymptotic  noriparumetrir 
confidence  interval  for  8(F). 


3.  COMPARISON  WITH  OPTIMAL  NORMAL  THEORY  ESTIMATORS 

Whan  F  is  normal,  best  estimators  of  |][|  of  the  form  cn)P  0n  are  known 
for  various  loss  functions.  For  example,  the  optimum  choice  of  entP  is 
(n-p)!(n-l )P/n!  =  1  +  p(p-3)/2n  +  0  (n~2)  for  each  of  the  three  losses: 

L,(|Zl,  III)  =  (III  -  ill  )2,  La(|zl,  III)  =  (lll/lzl-  1>2  and  L3(|Zl,  III)  = 

(IZl/IZl)  -  ln(|X|/|Xl)  -  1.  The  risk  of  cn(P  9n  under  L,{*>  is  easily 
computed  as 

(3.1) 
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where  v  p)  and  y  (pi  depend  on  p.  Comparing  the  rises  of  ' 0n  and 
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c*  8n,  one  gets 
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n 

The  result  under  L2(-)  is  obtained  from  (3.2)  by  dropping  the  term  III2.  For 
the  loss  function  L3(')>  the  risk  of  cnip9n  is  obtained  as 

c  ‘1  -  p(p-.ll/2n)  -  In  c  +  c  ^  (p)/n2  ^  V'  (p,n)  +  o(n  2)  (3.3) 
n ,  p  n ,  p  n,p  3  11 

where  y, ! p)  depends  on  p  and  >-.»<' p,n)  depends  on  both  p  and  n.  A 
comparison  of  the  risks  of  ®n  anc^  ®n  immediately  gives 
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(3.4) 


It  follows  from  (3.2)  and  (3.4)  that  9*  performs  quite  well  for  normal  F  even 
for  moderate  values  of  p. 

It  is  clear  from  (2.9)  that  the  genuine  nonparametric  unbiased  estimator 
Un  of  9(F)  is  somewhat  difficult  for  computation.  However,  the  other 
competing  asymptotically  unbiased  (up  to  o(n-1))  nonparametric  estimator  9^  ‘s 
easy  to  work  with. 


4.  CONCLUDING  REMARKS 


It  is  interesting  t<>  point  out  what  happens  if  we  consider  the  problem  of 
estimation  of  |  £  j  1  -  »  (say)  under  the  nonpararnetric  setup.  First,  it  is 
clear  that,  unlike  in  the  previous  problem,  here  a  kernel  which  can  be  used  to 
construct  an  unbiased,  symmetric,  nonpararnetric  estimator  of  »  is  not 
available.  This  observation  automatically  justifies  the  obvious  utility  of 
jackknifed  estimators.  Second,  one  may  proceed  to  work  with  the  jackknifed 
version  of  .®n  ~  n  wher  e  L  n  is  the  symmetric,  unbiased  estimator  of  |  £| 
defined  in  (2.9).  Various  asymptotic  properties  of  this  nonparametnr  estimator 
are  readily  available  in  the  form  of  general  functions  of  IJ-stat ist ins 
containing  Ur,  *  ns  a  special  ease'  in  Sen  <1977;,  Section  3.  Third,  it  is 
also  possible  to  use  the  jackknifed  version  of  the  parametric  estimator 
j  Sn |  P.  Using  the  relations  (2. 29)- (2. 31 ) ,  it  is  easy  to  verify  that  this 
version  results  in  j  Sn |  P  itself  at  least  for  large  n.  Finally, 
we  note  that,  computationally  |  Sn  |  P  much  simpler  than  the  jackknifed 
version  of  ut,  P  . 
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